Sequence Mining and Pattern Analysis in Drilling Reports with Deep Natural Language Processing

نویسندگان

  • Júlio Hoffimann
  • Youli Mao
  • Avinash Wesley
  • Aimee Taylor
چکیده

Drilling activities in the oil and gas industry have been reported over decades for thousands of wells on a daily basis, yet the analysis of this text at large-scale for information retrieval, sequence mining, and pattern analysis is very challenging. Drilling reports contain interpretations written by drillers from noting measurements in downhole sensors and surface equipment, and can be used for operation optimization and accident mitigation. In this initial work, a methodology is proposed for automatic classification of sentences written in drilling reports into three relevant labels (EVENT, SYMPTOM and ACTION) for hundreds of wells in an actual field. Some of the main challenges in the text corpus were overcome, which include the high frequency of technical symbols, mistyping/abbreviation of technical terms, and the presence of incomplete sentences in the drilling reports. We obtain state-of-the-art classification accuracy within this technical language and illustrate advanced queries enabled by the tool.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pattern Mining with Natural Language Processing: An Exploratory Approach

Pattern mining derives from the need of discovering hidden knowledge in very large amounts of data, regardless of the form in which it is presented. When it comes to Natural Language Processing (NLP), it arose along the humans’ necessity of being understood by computers. In this paper we present an exploratory approach that aims at bringing together the best of both worlds. Our goal is to disco...

متن کامل

Prediction of mineral deposit model and identification of mineralization trend in depth using frequency domain of surface geochemical data in Dalli Cu-Au porphyry deposit

In this research work, the frequency domain (FD) of surface geochemical data was analyzed to decompose the complex geochemical patterns related to different depths of the mineral deposit. In order to predict the variation in mineralization in the depth and identify the deep geochemical anomalies and blind mineralization using the surface geochemical data for the Dalli Cu-Au porphyry deposit, a ...

متن کامل

An Empirical Evaluation of various Deep Learning Architectures for Bi-Sequence Classification Tasks

Several tasks in argumentation mining and debating, question-answering, and natural language inference involve classifying a sequence in the context of another sequence (referred as bisequence classification). For several single sequence classification tasks, the current state-ofthe-art approaches are based on recurrent and convolutional neural networks. On the other hand, for bi-sequence class...

متن کامل

Data Mining of Causal Relations from Text: Analysing Maritime Accident Investigation Reports

Text mining is a process of extracting information of interest from text. Such a method includes techniques from various areas such as Information Retrieval (IR), Natural Language Processing (NLP), and Information Extraction (IE). In this study, text mining methods are applied to extract causal relations from maritime accident investigation reports collected from the Marine Accident Investigati...

متن کامل

Identification of mineralization features and deep geochemical anomalies using a new FT-PCA approach

The analysis of geochemical data in frequency domain, as indicated in this research study, can provide new exploratory informationthat may not be exposed in spatial domain. To identify deep geochemical anomalies, sulfide zone and geochemical noises in Dalli Cu–Au porphyry deposit, a new approach based on coupling Fourier transform (FT) and principal component analysis (PCA) has beenused. The re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1712.01476  شماره 

صفحات  -

تاریخ انتشار 2017